Sampling near neighbors in search for fairness
نویسندگان
چکیده
Similarity search is a fundamental algorithmic primitive, widely used in many computer science disciplines. Given set of points S and radius parameter r > 0, the -near neighbor ( -NN) problem asks for data structure that, given any query point q , returns p within distance at most from q. In this paper, we study -NN light individual fairness providing equal opportunities: all that are should have same probability to be returned. The special interest high dimensions, where Locality Sensitive Hashing (LSH), theoretically leading approach similarity search, does not provide guarantee. work, show LSH-based algorithms can made fair, without significant loss efficiency. We propose several efficient structures exact approximate variants fair NN problem. Our works more generally sampling uniformly sub-collection sets collection few other applications. also carried out an experimental evaluation highlights inherent unfairness existing structures.
منابع مشابه
the search for the self in becketts theatre: waiting for godot and endgame
this thesis is based upon the works of samuel beckett. one of the greatest writers of contemporary literature. here, i have tried to focus on one of the main themes in becketts works: the search for the real "me" or the real self, which is not only a problem to be solved for beckett man but also for each of us. i have tried to show becketts techniques in approaching this unattainable goal, base...
15 صفحه اولNear Fairness in Matroids
This article deals with the fair allocation of indivisible goods and its generalization to matroids. The notions of fairness under consideration are equitability, proportionality and envy-freeness. It is long known that some instances fail to admit a fair allocation. However, an almost fair solution may exist if an appropriate relaxation of the fairness condition is adopted. This article deals ...
متن کاملNear-optimal sample compression for nearest neighbors
We present the first sample compression algorithm for nearest neighbors with nontrivial performance guarantees. We complement these guarantees by demonstrating almost matching hardness lower bounds, which show that our bound is nearly optimal. Our result yields new insight into margin-based nearest neighbor classification in metric spaces and allows us to significantly sharpen and simplify exis...
متن کاملHypercube LSH for Approximate near Neighbors
A celebrated technique for finding near neighbors for the angular distance involves using a set of random hyperplanes to partition the space into hash regions [Charikar, STOC 2002]. Experiments later showed that using a set of orthogonal hyperplanes, thereby partitioning the space into the Voronoi regions induced by a hypercube, leads to even better results [Terasawa and Tanaka, WADS 2007]. How...
متن کاملQuantitative Analysis of Nearest-Neighbors Search in High-Dimensional Sampling-Based Motion Planning
We quantitatively analyze the performance of exact and approximate nearest-neighbors algorithms on increasingly high-dimensional problems in the context of sampling-based motion planning. We study the impact of the dimension, number of samples, distance metrics, and sampling schemes on the efficiency and accuracy of nearest-neighbors algorithms. Efficiency measures computation time and accuracy...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Communications of The ACM
سال: 2022
ISSN: ['1557-7317', '0001-0782']
DOI: https://doi.org/10.1145/3543667